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METHOD AND APPARATUS FOR THE EMULATION OF 
HIGH PRECISION FLOATING POINT INSTRUCTIONS 



The subject invention relates generally to the field 
of computers and computer software and, more particularly, 
to an apparatus and method for emulating high precision 
floating point instructions. 

Floating point notation is widely used in digital data 
processing devices, such as microprocessors, to represent 
a much larger range of numbers than can be represented in 
regular binary notation. Various types of floating point 
notations are used. Typically, a floating point number has 
a sign bit (s) , followed by an exponent field (e) and a 
mantissa field. 

Microprocessors typically contain and work together 
with floating point units (FPU) to perform operations, 
such as addition and subtraction, on floating point 
numbers. FPUs have the ability to support complex 
numerical and scientific calculations on data in floating 
point format. In order to add or siibtract floating point 
numbers, the decimal points must be aligned. The process 
is equivalent to addition or subtraction of base ten 
numbers in scientific notation. Generally, the FPU 
performs operations on the exponents and mantissas of the 
values in order to align the decimal points. 

Once the decimal points are aligned, the mantissas can 
be added or subtracted in accordance with the sign bits. 
The result may need to be normalized, or left shifted, so"' 
that a one is in the most significant bit position of the 
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mantissa. The result may also be rounded. Many different 
representations can be used for the mantissa and exponent 
themselves, where IEEE standard 754, entitled "IEEE 
Standard for Binary Floating point Arithmetic (ANSI/IEEE 
5 Std 754-1985) , provides a standard used by many CPUs and 
FPUs which defines formats for representing floating point 
numbers, representations of special values (e.g., 
infinity, very small values, NaN) , exceptions, rounding 
modes, and a set of floating point operations that will 
10 work identically on any conforming system. The IEEE 754 
. Standard further specifies the formats for representing 
floating point values with single-precision (32-bit), 
double -precision (64-bit), single -extended precision (up 
to 80-bits) , and doiible-extended precision (128-bit) . 

Most microprocessors also typically include integer 
units for performing integer operations. An integer unit 
is typically provided to perform integer operations, sjich 
as addition and subtraction. While integer units are 
20 common in microprocessors, floating point arithmetic 
performed using integer operations is much more costly 
than the equivalent floating point operations. Thus, most 
microprocessor utilize a combination of FPUs and integer 
xinits to perform necessary calculations. The precision 
25 capable of being achieved in such calculations is 
determined by the actual architecture of the FPU and 
integer unit hardware associated with the microprocessor. 

Across the embedded and non-embedded CPU market, one 
30 finds predominant Instruction Set Architectures (ISAs) for 
which large bodies of software exist that could be 
"Accelerated" for performance, or "Translated" to a myriad 
of capable processors that could present better 



cost/performance benefits, provided that they could 
transparently access the relevant software. One also finds 
dominant CPU architectures that are locked in time to 
their ISA, and cannot evolve in performance or market 
reach and would benefit from ''Synthetic CPU" 
co-architecture . 

It is often desired to run program code written for a 
computer processor of a first type (a '^subject" processor) 
on a processor of a second type (a ''target" processor) . 
Here, an emulator or translator is used to perform program 
code translation, such that the subject program is able to 
run on the target processor. The emulator provides a 
virtual environment, as if the subject program were 
running natively on a subject processor, by emulating the 
subject processor. The precision of the calculations 
which can be performed on the values of the subject 
program have conventionally been limited by the hardware 
architecture of the target processor. 

According to the present invention there is provided 
an apparatus and method as set forth in the appended 
claims- Preferred features of the invention will be 
apparent from the dependent claims, and the description 
which follows. 

The following is a summary of various aspects and 
advantages realizable according to various embodiments of 
the improved architecture for program code conversion 

according to the present invention. It is provided as an 
introduction to assist those skilled in the art to more 
rapidly assimilate the detailed discussion of the 
invention that ensues and does not and is not intended in 
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any way to limit the scope of the claims that are appended 
hereto. 

In particular, the inventors have developed an 
5 improved method and apparatus for expediting program code 
conversion, . particularly useful in connection with an 
emulator which emulates subject program code on a target 
machine where the subject machine base operands possess a 
different precision than the target machine. More 

10 particularly, a high precision floating point emulator is 
provided for the emulation of subject program code 
instructions having a higher precision than that supported 
by the target machine architecture by utilizing 
intermediate calculations having values with a higher 

15 precision than that supported by the target machine. 

The present invention, both as to its organization and 
manner of operation, together with further advantages, may 
best be understood by reference to the following 
20 description, taken in connection with the accompanying 
drawings in which the reference numerals designate like 
parts throughout the figures thereof and wherein: - 

Figure 1 shows a computing environment including 
25 subject and target processor architectures; and 

Figure 2 is an operational flow diagram that describes 
an example of the high precision floating point emulation 
performed in accordance with a preferred embodiment of the 
30 present invention. 

The following description is provided to enable any 
person skilled in the art to make and use the invention 
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and sets forth the best modes contemplated by the 
inventors of carrying out their invention. Various 
modifications, however, will remain readily apparent to 
those skilled in the art, since the general principles of 
5 the present invention have been defined herein 
specifically to provide an improved high precision 
floating point emulation apparatus . 

Referring to Figure 1, an example computing 
10 environment is shown including a subject computing 
environment 1 (''subject machine 1") and a target computing 
environment 2 (''target machine 2"). In the subject 
machine 1, subject code 10 is executable natively on a 
subject processor 12. The subject processor 12 includes a 
15 set of subject registers 1-4, Here, the subject code 10 may 
be represented in any suitable language with intermediate 
layers (e.g., compilers) between the subject code 10 and 
the subject processor 12, as will be familiar to a person 
skilled in the art. 

20 

It is desired in some situations to run the subject 
code 10 on the target machine 2 of the present invention, 
which includes a target processor 22 using a set of target 
registers 24. The two processors 12 and 22 of the subject 

25 machine 1 and the target machine 2, respectively, may be 
inherently non- compatible, such that these two processors 
12 and 24 use different instruction sets. The target 
processor 22 includes a floating point unit 28 for 
computing floating point operations and an integer unit 2 6 

30 for performing integer operations. The floating point 
unit 2 8 and the integer unit 2 6 may comprise any of a wide 
variety of types of hardware units, as known to those 
skilled in the art, where the floating point unit 28 is 
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preferably IEEE 754 Standard compatible floating point 
hardware . 



The two processors 12 and 22 may operate with 
different levels of accuracy and precision depending upon 
their particular architectures as well as the hardware 
designs of their respective floating point unit 2 8 and the 
integer unit 26. Hence, a floating point emulator 20 is 
provided in the target machine 2, in order to emulate high 
precision instructions from the subject code 10 in the 
target computing environment 2. High precision refers to 
a level of precision which is higher than that provided by 
the target machine 2, where the base operands in 
instructions of the subject program code have a higher 
15 precision than that supported by the target machine 2. 
The floating point emulator 2 0 provides for a higher level 
of precision during calculations than the target 
architecture 2 could otherwise provide, thus providing a 
higher level of accuracy in the emulated instructions. 
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20 



25 



The floating point emulator 20 is preferably a 
software component, i.e., a compiled version of the source 
code implementing the emulator, run in conjunction with an 
operating system running on the target processor 22, 
typically a microprocessor or other suitable processing 
device. it will be appreciated that the structure 
illustrated in FIG. 1 is exemplary only and that, for 
example, software, methods and processes according to the 
invention may be implemented in code residing within or 
30 beneath an operating system. 

Referring now to Figure 2, an operational flow diagram 
of a method of performing high precision floating point 
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emulation in accordance with a preferred embodiment of the 
present invention is illustrated. The high precision 
floating point emulation algorithm described hereafter 
refers to a single embodiment of the invention that 
5 provides for the emulation of high precision floating 
point accumulated instructions, while it is understood 
that the floating point emulator 20 is capable of 
emulating any types of instructions where the subject 
machine 1 base operands are at a different precision than 

10 the target machine 2. For example, the floating point 
emulator 2 0 would allow the emulation of the addition of 
two double precision values, such as a subject machine's 
Double (x) + Double (y) operation, on a target machine 2 
that only supports single precision floating point 

15 operations. With this understanding and for ease of 
discussion, the high precision floating point emulation 
algorithm will be described hereafter with reference to 
the emulation of high precision floating point accumulated 
instructions of the form: 

20 

d = ± (a*b ± c) 

where a, b, c and d are operands which can be expressed as 
floating point numbers . High precision as referred to in 

25 this description means any precision which is higher than 
that provided by the target machine 2. For instance, if 
the architecture pf the target machine 2 supports IEEE 
Standard .754 double-precision floating point values, then 
high precision would refer to any values having a higher 

30 precision than double-precision floating point values. It 
should be noted that the floating point emulator 2 0 only 
calculates the intermediate values of the accumulated 
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instructions at high precision, and the operands 
themselves and the result are not at high precision. 

The high precision floating point algorithm embodied 
in FIG. 2 utilizes standard integer techniques to perform 
the calculation of the accumulated instructions in stages, 
where, at the end of each stage, the intermediate values 
are tested at runtime to ascertain whether the 
intermediate values have reached a point at which the 
hardware (i.e., the target processor 22, integer unit 26, 
and floating point unit 28) of the target machine 2 has 
enough precision to finish the calculation without loss of 
accuracy. To achieve this, the high precision floating 
point algorithm performed by the floating point emulator 
2 0 works in combination with an integer unit 26 and an 
IEEE Standard 754 compatible floating point hardware unit 
28. 

A wide number of integer techniques for performing 
floating point emulation are known to those skilled in the 
art, where the high precision floating point algorithm of 
the preferred embodiment described herein accelerates the 
process by using floating point hardware, namely floating 
point unit 28. These process accelerations are referred 
to as fast exit points hereinafter, because they exit the 
testing routine performed at runtime to determine if the 
intermediate values are at a level such that the target 
architecture 2 has enough precision to finish the 
calculation without loss of accuracy and the hardware of 
the 'target architecture 2 is immediately used to perform 
the necessary calculations. 
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Fast Exit Points 

The floating point emulator 2 0 begins the high 
precision floating point algorithm when a floating point 
accumulated instructions of the form: d = ± (a*b ± c) is 
encountered in step 200. It is determined in step 202 if 
any of the three input operands (a, b, c) can be 
considered a special value, where special values include 
zero, infinity or NAN (not a number) . For each of these 
special values, there is a known result, such as dictated 
by the IEEE Standard 754, that all compatible hardware 
will produce regardless of the level of precision and 
hence there is no need for expensive integer emulation. 
Thus, there is a fast exit point for any operands 
identified as special values to perform the calculation 
using the target architecture's floating point unit 28 in 
step 204 . 

If none of the operands (a, b. c) are special values, 
it is next determined in step 2 06 whether the exponent for 
the result of the multiplication (a*b) overlaps with the 
exponent of operand c. Two values will overlap if the 
addition/ subtract ion of the significant digits of the two 
values yields a result different from each of the two 
values. In this context, non- over lapping refers to the 
fact that either a*b or c is so large as to make the other 
insignificant. By way of example, in the situation where 
a particular FPU is only capable of representing 3 
significant digits, if the value 3.10 is added to the 
value 0.01, then it can be seen that both values are 
important to the result, i.e., performing the addition 
will yield a result different to the sources and the 
sources thus overlap. Contrarily, for the same FPU only 



capable of representing 3 significant digits, if the value 
310 is added to the value 0.01, the result when using 3 
significant figures is 310. Thus, in this situation the 
result is the same as the first source and the two values 
did not overlap 

When the two values, fail to overlap, the addition of 
the values is not required. Thus, if the exponent for the 
result of the multiplication (a*b) does not overlap with 
the exponent of operand c, then the floating point 
algorithm determines that the addition/ subtraction is not 
required and another fast exit point is provided where the 
calculation can be performed using the target machine 2's 
FPU 28 in step 204. It should be noted that thresholds 
for determining whether two values overlap can either be 
variably selected or can be thresholds established by 
formats well-known to those skilled in the art can be 
utilized, such as the IEEE Standard 754 floating point 
double-precision format . 

When the exponents of the result of (a*b) and c do 
overlap, the mantissa for the result of the multiplication 
(a*b) is calculated in step 208 in order to establish how 
much precision is required by the mantissa. It is 
determined in step 210 whether the result of the 
multiplication (a*b) requires more mantissa bits than is 
provided by the FPU 2 8 of the target machine 2. For 
example, when the FPU 28 is capable of handling double- 
precision numbers, it is known that operands containing 52 
mantissa bits are utilized for double-precision values. 
If the number of bits required by the mantissa {a*b) is 
less than or equal to the number of mantissa bits capable 
of being handled by the FPU 28 (e.g., 52 bits in the case 
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of double-precision nurabers) , then the FPU 2 8 has 
sufficient precision to perform the calculation. Thus, if 
the mantissa {a*b) requires no more mantissa bits than are 
provided for by the FPU 28, another fast exit point is 
5 provided and the result is calculated using the target 
architecture's FPU 28 in step 204. In the determination 
made in step 210, the precision is determined by comparing 
the spread of the mantissa, namely the number of bit 
positions between the most and least significant set bits. 

10 

When the mantissa (a*b) requires more bits, than 
provided by the FPU 28, the full calculation of a*b is 
completed using the integer unit 26 in step 212. At this 
point, the high precision floating point algorithm 

15 calculates the ±mantissa (a*b) ± mantissa (c) in step 214 . 
A determination is made in step 216 whether the resulting 
mantissa is equal to zero, where upon another fast exit 
point can be created to use the target architecture 2 ' s 
most efficient mechanism, namely the FPU 28, to set the 

20 final result to 0.0 in step 218. Thus, this fast exit 
point is only valid when the mantissa (a*b) and the 
mantissa (c) are subtracted from one another to yield a 
resulting mantissa equal to zero. This removes the need 
to calculate the final exponent and also bypasses any 

25 expensive rounding. However, if the final resulting 
mantissa is not equal to zero, then the remaining parts of 
the calculation of a*b + c must be calculated using the 
integer unit 26 in step 220. 

30 The various fast exit point provided in the high 

precision floating point algorithm described above provide 
for faster and more efficient emulation of accumulated 
instructions by maximizing the use of the FPU 2 8 for 



performing floating point arithmetic. By implementing the 
high precision floating point algorithm of the preferred 
embodiment, accumulated instructions are calculated at a 
higher precision than the operands are typically capable 
of being handled by the architecture of the target machine 
2, resulting in accumulated instructions effectively being 
calculated with greater precision. If the intermediate 
result is twice the precision of the sources, then the 
accumulated instructions can be calculated to an infinite 
precision (i.e., no loss of accuracy). 

For the purposes of illustrating the steps performed 
by the floating point emulator 20 in implementing the 
above-described high precision floating point algorithm, 
the following example is provided without any intention by 
the inventors of the present invention to limit the scope 
of their invention to the described example. 

This example utilizes an accumulated instruction 
having three IEEE Standard 754 double precision floating 
point values for operands (a, b, c) . The IEEE Standard 
754 standard dictates that a double-precision floating 
point value is 64 bits wide, including 1 sign bit, an 11- 
bit exponent, and a 52-bit mantissa) . The example uses 
the definitions in the following legend: 



the input operands 

the intermediate operands 

the sign part of the operand x, represented as a Boolean value 

the exponent part of operand x, represented as an integer 

the mantissa part of the .operands x including the implied one, 
represented as a larger integer 

calculate x using the targets floating point xmit 28 and exit. 
These are indicative of a fast exit point. 



Legend 
a, b, c 
a*b, a*b-c 
sign(x) 
exp(x) 
man(x) 

FPU(x) 



sub(x,y) 
mul(x,y) 
shift__right(x,y) 



x-y 



x*y 

Shift X y places to the right 



In this example, a large integer means larger than 
that provided by the hardware of the target architecture. 
The last three operations in the legend are represented as 
functions as they operate on large integers. 

The pseudo-code for the high precision floating point 
algorithm for the accumulated instruction fmsub (i.e., a * 
b - c) is as follows. Initially, it is determined if any 
of the operands are special to apply an early exit: 



When the operands are not special, it must then be 
determine whether to apply a different early exit by 
determining if the subtraction operation is significant, 
i.e., it has an effect on the final result within the 
required emulation accuracy or SMA (Subject Machine 
Accuracy) . 

Considering (a*b) - c, the subtraction would not be 
significant for the following two cases: 

1. (a*b) - c == (a*b) : SMA 

2. (a*b) - c == -c : SMA 



If (a or b or c) = (±infinity or 0.0 or NAN) 
FPU(a*b - c) 



Endlf 
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The particular emulation must be taken into account to 
determine whether accuracy of the SMA is similar to that 
of the Target Machine Accuracy (TMA) . For instance, for 
the PPC-P4 emulation, it follows that if the subtract 
operation is insignificant in SMA then it is also 
insignificant in TMA. 

A quick and efficient way to test for significance is 
to test for the mere possibility of the subtract operation 
affecting the result within the greater SMA. If it is 
determined that the result does not change within SMA, the 
calculation can be performed natively in TMA without 
precision loss. However, if there is a chance that the 
subtract operation could change the result in a change in 
15 SMA, the emulation performed by the high precision 
floating point algorithm continues. 

For subtraction, the operand with the lower exponent 
is first shifted to make its exponent the same as the 
larger exponent. The mantissas are then subtracted and the 
exponent remains the same. For the operand with the lower 
exponent, the initial exponent shift upwards results in 
the mantissa being shifted downwards. If this results in a 
zero mantissa then the subtract operation is not 
significant. A zero mantissa will be produced if the 
es^onent needs to be raised more than the number of bits 
of accuracy in the mantissa=. Thus, 
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if ((higherExp - lowerExp) > Mantissa Bits) 
Subtraction not significant 

else 

5 Subtraction possibly significant 

fi 



Considering the PPC instruction, fmsiib, (a*b) - c, the 
10 intermediate result of the multiply is accurate to 106 
mantissa bits and the final result of the subtraction is 
accurate to 53 mantissa bits. Therefore a*b has a maximum 
106 mantissa bits and c has a maximum 53 mantissa bits. 
For PPC, the above pseudo code now becomes: 
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if(exp(a*b) > exp(c)) 

if(exp(a*b) - exp(c) > 53) 

// Subtraction not significant, take fast exit 
5 FPU(a*b -c) 

else 

// Subtraction possibly significant, continue emulation 

fi 

else // exp(a*b) <= exp(c) 
0 ififexrjfc) - '-^'^(^ > i o^^ 

II Subtraction not significant, take fast exit 
FPU(a*b -c) 

else 

// Subtraction possibly significant, continue emulation 

5 fi 
fi 



The calculation^ of exp(a*b) involves the quick 
addition, exp(a) + exp(b). This fast exit check can 
therefore be done before the expensive SMA multiplication 
of a and b. The mantissa of a*b is then calculated: 



man(a*b) = mul(man(a), man(b)) 



It is known that a double-precision floating point 
value has 52 bits for its mantissa (plus the implied 1) , 
thus man(x) it 53 bits wide. The result'^ of the 

multiplication will therefore be a maximum of 106 bits 
wide. It is then determined if the extra precision is 
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required by examining the spread of the resulting 
mantissa. If this mantissa would fit within a float 
double (i.e., 53bits including the implied one), then the 
extra precision is not required. This is tested by 
checking to see if the bottom 53bits of the resulting 
mantissa were used. 



If ((man(a*b) & OxlffFfifffffFff) = 0) 
10 FPU(a*b-c) 
Endlf 

exp(a*b) = exp(a) + exp(b) 
sign(a*b) = sign(a) xor sign(b) 
15 



It is now necessary to align a*b and c, in order to 
perform the subtraction. 
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If(exp(a*b)>exp(c)) 

shift_right(man(c), exp(a*b) - exp(c)) 
exp(a*b-c) = exp(a*b) 

5 Else 

shift_right(man(a*b), exp(c) - exp(a*b)) 
exp(a*b-c) = exp(c) 

Endlf 

' If fman''!"*'^'' > mor,r^^\ 

sub(man(a*b), man(c)) 
sign(a*b-c) = sign(a*b) 

Else 

sub(man(c), man(a*b)) 
' sign(a*b-c) = sign(c) 

Endlf 



The resulting mantissa is then checked to see if it 
equals zero: 



If (man(a*b-c) = 0) 
FPU(0.0) 

Endlf 



At this point, emulation has either exited via a fast 
exit point or it has been determined that the full 
precision is required. The result sign, exponent and 
mantissa have all been calculated, where the only 
operation remaining is to convert the result into the 
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subject machine's floating point format, which involves 
aligning the result and rounding. 

As can be seen from the foregoing, an emulator 
5 described in the various embodiments above provide for the 
high precision emulation of subject program code on a 
target machine where the subject machine base operands 
possess a different precision than the target machine. 
Moreover, the emulation of subject program code 
10 instructions having a higher precision than that supported 
by the target machine architecture is provided by- 
utilizing intermediate calculations having values with a 
higher precision than that supported by the target 
machine . 

15 

The different structures of the high precision 
floating point emulation apparatus and method of the 
present invention are described separately in each of the 
above embodim.ents . However, it is the full intention of 

20 the inventors of the present invention that the separate 
aspects of each embodiment described herein may be 
combined with the other embodiments described herein. 
Those skilled in the art will appreciate that various 
adaptations and modifications of the just described 

25 preferred embodiments can be configured without departing 
from the scope and spirit of the invention. Therefore, it 
is to be understood that, within the scope of the appended 
claims, the invention may be practiced other than as 
specifically described herein. 

30 

Although a few preferred embodiments have been shown 
and described, it will be appreciated by those skilled in. 
the art that various changes and modifications might be 
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made without departing from the scope of the invention/ as 
defined in the appended claims. 

Attention is directed to all papers and documents 
which are filed concurrently with or previous to this 
specification in connection with this application and 
which are open, to public inspection with this 
specification, and the contents of all such papers and 
documents are incorporated herein by reference. 



All of the features disclosed in this specification 
(including any accompanying claims, abstract and 
drawings) , and/or all of the steps of any method or 
process so disclosed, may be combined in any combination, 
15 except combinations where at least some of such features 
and/or steps are mutually exclusive. 

Each feature disclosed in this specification 
(including any accompanying claim.s, abstract and drawings) 
may be replaced by alternative features serving the same, 
equivalent or similar purpose, unless expressly stated 
otherwise. Thus, unless expressly stated otherwise, each 
feature disclosed is one example only of a generic series 
of equivalent or similar features. 



The invention is not restricted to the details of the 
foregoing embodiment (s) . The invention extends to any 
novel one, ^ or any novel combination, of ,the features 
disclosed in this specification (including any 
accompanying claims, abstract and drawings), or to any 
novel one, or any novel combination, of the steps of any 
method or process so disclosed. 



Claims 



1. A method of performing high precision emulation of 
program code instructions for a subject machine on a 
target machine, comprising: 

determining if operands in instructions of the program 
code for the subject machine require a different precision 
than provided for by the target machine; and 

applying a floating point emulation algorithm to 
perform intermediate calculations on the operands of the 
instructions at a higher precision than the precision 
supported by the target machine . 

2. The method of claim 1, wherein the target machine 
includes floating point hardware and integer hardware, 
wherein said floating point emulation algorithm comprises: 

utilizing floating point hardware on the target 
machine to perform calculations on the operands of the 
instructions when it is determined based upon the 
intermediate calculations that the target machine provides 
sufficient precision for the calculations required by the 
instructions; and 

utilizing integer hardware on the target machine to 
perform calculations not selected to be performed by the 
floating point hardware. 

3. The method of claim 2, wherein the program code 
instructions are accumulated instructions that are 
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calculated at a higher precision than the operands capable 
of being handled by the target machine. 

4. The method of claim 2, wherein the program code 

instructions are floating point accumulated instructions 
of the form: d = ±(a*b ± c) , wherein a, b, c and d are 
operands expressible as floating point numbers. 

5- The method of claim 4, further comprising 

identifying whether any of the operands (a, b, or c) are 
special values having a known result that all compatible 
hardware will produce regardless of the level of precision 
of said hardware . 

6. The method of claim 5, wherein said special values 
include either zero, infinity, or NaN (not a number) , 
wherein the floating point hardware is utilized to 
calculate the result of the accumulated .instructions when 
any of the operands (a, b, or c) are identified as special 
values . 

7. The method of claim 4, wherein said floating point 
emulation algorithm further comprises: 

determining whether the exponent for the result of the 
multiplication of (a*b) overlaps with the exponent of c; 
and 

Utilizing the floating point hardware to calculate the 
result ■ of the accumulated instructions when the exponent 
for the result of the multiplication of (a*b) fails to 
overlap with the exponent of c . 



8. The method of claim 1, wherein, when the exponent 
for the result of the multiplication of (a*b) overlaps 
with the exponent of c, said floating point emulation 
algorithm further comprising: 

determining whether the mantissa for the result of the 
multiplication (a*b) requires more mantissa bits than 
provided for by said floating point hardware; and 

utilizing the floating point hardware to calculate the 
result of the accumulated instructions when the result of 
the multiplication {a*b) ' does not require more mantissa 
bits than provided for by the floating point hardware. 

9. The method of claim 8, said floating point 
emulation algorithm further comprising computing the full 
calculation of a*b using the integer hardware when 
mantissa for the result of the multiplication (a*b) 
requires more mantissa bits than provided for by the 
floating point hardware . 

10. The method of claim 9, said floating point 
emulation algorithm further comprising: 

determining whether the final resulting mantissa of 
the mantissa (a*b) - the mantissa (c) equals zero; 

utilizing the floating point hardware to make the 
result equal to zero when the resulting mantissa is equal 
to zero; and 



24 



calculating the remaining parts of the calculation of 
a*b + c using the integer hardware when the final 
resulting mantissa is not equal to zero. 

11. A computer -readable storage medium having software 
resident thereon in the form of computer -readable code 
executable by a computer to perform the following steps in 
the high precision emulation of program code instructions 
for a subject machine on a target machine: 

determining if operands in instructions of the program 
code for the subject machine require a different precision 
than provided for by the target machine; and 

applying a floating point emulation algorithm to 
perform intermediate calculations on the operands of the 
instructions at a higher precision than the precision 
supported by the target machine . 

12. The computer-readable storage medium of claim 11, 
wherein the target machine includes floating point 
hardware and integer hardware, wherein said floating point 
emulation algorithm comprises: 

utilizing floating point hardware on the target 
machine to perform calculations on the operands of the 
instructions when it is determined based upon the 
intermediate calculations that the target machine provides 
sufficient precision for the calculations required by the 
instructions; and 



utilizing integer hardware on the target machine to 
perform calculations not selected to be performed by the 
floating point hardware. 

13- The computer- readable storage medium of claim 12, 

wherein the program code instructions are accumulated 
instructions that are calculated at a higher precision 
than the operands capable of being handled by the target 
machine . 

14. The computer- readable storage medium of claim 13, 
wherein the program code instructions are floating point 
accumulated instructions of the form: d = ±(a*b ± c) , 
wherein a, b, c and d are operands expressible as floating 
point numbers . 

15. The computer- readable storage medium of claim 14, 
said computer- readable code further executable for 
identifying whether any of the operands (a, b, or c) are 
special values having a known result that all compatible 
hardware will produce regardless of the level of precision 
of said hardware . 

16. The computer- readable storage medium of claim 15, 
wherein said special values include either zero, infinity, 
or NaN (not a number) , wherein the floating point hardware 
is utilized to calculate the result of the accumulated 
instructions when any of the operands (a, b, or c) are 
identified as special values. 

17. The computer- readable storage medium of claim 14, 
wherein said floating point emulation algorithm further 
comprises : 
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determining whether the exponent for the result of the 
multiplication of (a*b) overlaps with the exponent of c; 
and 

utilizing the floating point hardware to calculate the 
result of the accumulated instructions when the exponent 
for the result of the multiplication of (a*b) fails to 
overlap with the exponent of c. 

18. The computer- readable storage medium of claim 17, 

wherein, when the exponent for the result of the 
multiplication of (a*b) overlaps with the exponent of c, 
said floating point emulation algorithm further comprises: 

determining whether the mantissa for the result of the 
multiplication {a*b) requires more mantissa bits than 
provided for by said floating point hardware; and 

utilizing the floating point hardware to calculate the 
result of the accumulated instructions when the result of 
the multiplication (a*b) does not require more mantissa 
bits than provided for by the floating point hardware. 

The computer- readable storage medium of claim 18, 
said floating point emulation algorithm further comprising 
computing the full calculation of a*b using the integer 
hardware when mantissa for , the result of the 
multiplication (a*b) requires more mantissa bits than 
3 0 provided for by the floating point hardware. 
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20. The computer -readable storage medium of claim 19, 
said floating point emulation algorithm further 
comprising : 

5 determining whether the final resulting mantissa , of 

the mantissa (a*b) - the mantissa (c) equals zero; 

utilizing the floating point hardware to make the 
result equal to zero when the resulting mantissa is equal 
10 to zero; and 

calculating the remaining parts of the calculation of 
a*b + c using the integer hardware when the final 
resulting mantissa is not equal to zero. 

15 

21. In combination: 

a target processor; and 

2 0 translator code . for performing high precision 

emulation of program code instructions for a subject 
machine on a target machine, said translator code 
comprising code executable by said target processor for 
performing the following steps: 

25 

determining if operands in instructions of the program 
code for the subject machine require a different precision 
than provided for by the target machine; and 

30 applying a floating point emulation algorithm to 

perform intermediate calculations on the operands of the 
instructions at a higher precision than the precision 
supported by the target machine . 
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22. The combination of claim 21, wherein the target 

machine includes floating point hardware and integer 
hardware, wherein said floating point emulation algorithm 
5 comprises: 



utilizing floating point hardware on the target 
machine to perform calculations on the operands of the 
instructions when it is determined based upon the 
intermediate calculations that the target machine provides 
sufficient precision for the calculations required by the 
instructions; and 



utilizing integer hardware on the target machine to 
15 perform calculations not selected to be performed by the 
floating point hardware. 

23. The combination of claim 22, wherein the program 

code instructions are accumulated instructions that are 
calculated at a higher precision than the operands capable 
of being handled by the target machine. 
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24. The combination of claim 23, wherein the program 
code instructions are floating point accumulated 

25 instructions of the form: d = ±(a*b ± c) , wherein a, b, c 
and d are operands expressible as floating point numbers. 

25. The combination of claim 24, wherein said floating 
point emulation algorithm comprises identifying whether 

30 any of the operands (a, b, or c) are special values having 
a known result that all compatible hardware will produce 
regardless of the level of precision of said hardware. 
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26. The combination of claim 25, wherein said special 
values include either zero, infinity, or NaN (not a 
number) , wherein the floating point hardware is utilized 
to calculate the result of the accumulated instructions 

5 when any of the operands (a, b, or c) are identified as 
special values . 

27. The combination of claim 24, wherein said floating 
point emulation algorithm further comprises: 

10 

determining whether the exponent for the result of the 
multiplication of (a*b) overlaps with the exponent of c; 
and 

15 utilizing the floating point hardware to calculate the 

result of the accumulated instructions when the exponent 
for the result of the multiplication of (a*b) fails to 
overlap with the exponenf^ of c. 

20 28. The combination of claim 27, wherein, when the 

exponent for the result of the multiplication of (a*b) 
overlaps with the exponent of c, said floating point 
emulation algorithm further comprises: 

25 determining whether the mantissa for the result of the 

multiplication (a*b) requires more mantissa bits than 
provided for by said floating point hardware; and 

utilizing the floating point hardware to calculate the 
30 result of the accumulated instructions when the result of 
the multiplication (a*b) does not require more mantissa 
bits than provided for by the floating point hardware. 



29. The combination of claim 28, said floating point 
emulation algorithm further comprising computing the full 
calculation of a*b using the integer hardware when 
mantissa for the result of the multiplication (a*b) 
requires . more mantissa bits than provided for by the 
floating point hardware. 

30. The combination of claim 29, said floating point 
emulation algorithm further comprising: 

determining whether the final resulting mantissa of 
the mantissa (a*b) - the mantissa (c) equals zero; 

utilizing the floating point hardware to make the 
result equal to zero when the resulting mantissa is equal 
to zero; and 



calculating the remaining parts of the calculation of 
a*b + c using the integer hardware when the final 
resulting mantissa is not equal to zero. 

31. A method of performing high precision emulation of 
program code instructions for a subject machine on a 
target machine, substantially as hereinbefore described 
with reference to the accompanying drawings. 

32. A computer- readable . storage medium having software 
resident thereon in the form of computer -readable code 
executable by a computer to perform the high precision 
emulation of program code instructions for a subject 
machine on a target machine, substantially as hereinbefore 
described with reference to the accompanying drawings. 



33 . In combination a target processor and 

translator code for performing high precision emulation of 
program code instructions for a subject machine on a 
target machine, said translator code comprising code 
executable by said target processor, substantially as 
hereinbefore described with reference to the accompanying 
drawings . 
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ABSTRACT 

METHOD AND APPARATUS FOR THE EMULATION OF 
5 HIGH PRECISION FLOATING POINT INSTRUCTIONS 

A high precision floating point emulator and 
associated method for emulating subject program code on a 

10 target machine where the subject machine base operands 
possess a different precision than the target machine. 
The high precision floating point emulator is provided for 
the emulation of subject program code instructions having 
a higher precision . than that supported by the target 

15 machine architecture by utilizing intermediate 
calculations' having values with a higher precision than 
that supported by the target machine. 
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